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SEPARATING SUBADDITIVE EUCLIDEAN FUNCTIONALS 


ALAN FRIEZE AND WESLEY PEGDEN 


Abstract. If we are given n random points in the hypercube [0,1]^, then the 
minimum length of a Traveling Salesperson Tour through the points, the min¬ 
imum length of a spanning tree, and the minimum length of a matching, etc., 

d-i 

are known to be asymptotically (3n d a.s., where p is an absolute constant 
in each case. We prove separation results for these constants. In particular, 
concerning the constants ,^mst’ '^mM’ ^TF ^he asymptotic 

formulas for the minimum length TSP, spanning tree, matching, and 2-factor, 
respectively, we prove that /5^grp < ^ /^TSP’ ^TF ^ ^TSP 

for all d > 2. We also asymptotically separate the TSP from its linear pro¬ 
gramming relaxation in this setting. Our results have some computational 
relevance, showing that a certain natural class of simple algorithms cannot 
solve the random Euclidean TSP efficiently. 


1. Introduction 

Beardwood, Halton, and Hammersley [3] studied the length of a Traveling Salesper¬ 
son Tour through random points in Euclidean space. In particular, if xi, X 2 ,... is a 
random sequence of points in [0,1]^ and Ah = {xi,..., x^}, their results imply that 
there is an absolute constant ^xsp such that the length TSP (Ah) of a minimum 
length tour through Ah satisfies 

(1) TSP(A’„) ~ a.s. 

This result has many extensions; for example, we know that identical asymp¬ 
totic formulas hold for the the cases of the minimum length of a spanning tree 
MST(A’„)[3], and the minimum length of a matching MM(A’„) [23]. Steele [25] 
provided a general framework which enables fast assertion of identical asymptotic 
formulas for these and other suitable problems. For example, we will see in Sec¬ 
tion 2 that his results imply that the length TF(A’„) of a minimum length 2-factor 
admits the same asymptotic characterization. 

A major remaining problem in this area is to obtain analytic results regarding 
the constants /3 in such formulas. In particular, the best rigorous bounds on such 
constants are generally very weak, with known results for d = 2 given in Table 1. 
In particular, the bounds on /^Up were not improved since 1959, until the paper 
[29] of Steinerberger improved the lower bound by ss .0036... and the upper 
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lower 

upper 

/^TSP 

.62866 [29] 

.92116... [3, 29] 

i^MST 

.60082 [2] 

72 « -707 [12] 

2/5mm 

.5 [5] 

.92116 

Table 1. 

Bounds on 

constants for d = 


bound by « 10 ® . On the other hand, there was some success as d grows large, as 
Bertsimas and Van Ryzen [5] showed that, asymptotically in d, 


( 2 ) 



and conjectured that /Slfgp 


' 2 ^ as well. 


It seems that it has been overlooked that local geometric arguments are sufficient 
to prove the separation of constants for many natural examples of Euclidean func¬ 
tionals. In particular, in the present paper, we will show that /3 mst < /^tspj 
/3xf < PtsP’ ‘^Pmm < /^TSP These are the first asymptotic separa¬ 

tions for Euclidean functionals where the Euclidean metric is playing an essential 
role: the only previous separation was shown (by Bern [4]; see also [18]) for the min¬ 
imum length rectilinear Steiner tree vs. the minimum rectilinear length spanning 
tree, which is equivalent to asymptotically distinguishing Steiner trees from trees 
in the Li norm. (The rectilinear Steiner tree is also the only case where the asymp¬ 
totic worst-case length is known exactly [7].) Einally, we will also asymptotically 
separate the TSP from its linear programming relaxation. 

We begin by considering the degrees of vertices in the minimum spanning trees 
among n random points. Steele, Shepp, and Eddy [28] showed that the number 
Ak{Xn) of vertices of degree k satisfies 

Ak{Xn) ~ Oik,dn 

for constants ak,d-, and proved that a{l,d) > 0. Note that we must have ak,d = 0 
when k > T{d), where r(c?) is the kissing number of d dimensional space (6 in the 
case d = 2). Indeed, we must have Ofc.d = 0 whenever k > T'{d), where T'{d) 
denotes a strict kissing number of d, which we define as the maximum K such 
that there exists e > 0 such that there is, in d dimensions, a conhguration of K 
disjoint spheres of radius 1 e each tangent to a common unit sphere. (Note that 
T'(d) < T{d), and in particular, t'( 2) = 5.) We prove: 

Theorem 1.1. a{k,d) >0 if k < T'{d). 


Considering Euclidean functionals MSTfc(T) (with corresponding constants 
defined as the minimum length of a spanning tree of X whose vertices all have degree 
< fc, we will then get separation as follows: 


Theorem 1.2. We have that 

(3) /?TSP = -^MST2 > /^MSTa > ’ ’ ’ > = /^MST 
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for all d. 


Thus, the MST^ constants are as diverse as are allowed by the simple geometric 
constraint of T'{d). 

Still, there are only finitely many constants /SmsTj, each d; while we can draw 
trees with very large degrees, large degrees (relative to d) are not useful for minimum 
spanning trees in Euclidean space. In contrast to this scenario, let us recall that 
a 2-factor is a disjoint set of cycles covering a given set of points. We will see in 
Section 2 that the length of the minimum 2-factor is indeed a subadditive Euclidean 
functional, and thus this length satisfies TE(T„) ~ /3^pn~3~ for some constant /3^p. 
Moreover, if TEg(X) is the minimum length of a 2-factor through X whose cycles all 
have length > g, then we will see that TEg is also a subadditive linear functional, 
so that we have TEg(A’„) ~ /^ TFg ^^ ■ Naturally, we must have / 3 ^p = < 

/3tf 4 — /^TFs < • • • • In analogy to the high-degree vertices in a tree, we can of 
course draw 2-factors with small cycles, but it is not clear a priori whether small 
cycles will be asymptotically essential to optimum 2-factors in random point sets. 
The following theorem shows that they are: 

Theorem 1.3. /3TFg is a monotone increasing sequence /3^p^ < /?tf 4 < /^tFs < 


On the other hand, we prove that 2-factors with long (but constant) girth require¬ 
ments prodnce close approximations to the TSP: 

Theorem 1.4. lim /d^p = /d^gp- 


With a bit more work, our method for proving Theorem 1.3 will also allow us to 
deduce the following: 

Theorem 1.5. 2/d^j^ < /3xsp- 

We note in contrast to Theorem 1.3 that in the independent case where the edge 
lengths Xe,e G (^ 2 ^) are independent uniform [0,1] random variables, Erieze [11] 
showed that with probability 1 — o(l), the weight of the minimal 2-factor is asymp¬ 
totically equal to the minimum length of a tour. 

We continue by mentioning a natural generalization of MM(T„). Given a fixed 
graph H on k vertices, an i/-factor of a set of points S' is a set of edges isomorphic 
to [|X|/fcJ vertex disjoint copies of H. As a subadditive Euclidean functional, the 
minimum length 7JF(A’„) of an H factor of Xn satisfies 

H¥{Xn) ^ . 

We pose the following conjecture: 

Conjecture 1.6. Given Hi,H 2 and d > 2, we have that fdjj^ unless Hi 

and H 2 are each isomorphic to a disjoint union of copies of some graph H^. In 
particular, ldjj_^ fd%_^ if Hi, H 2 are connected and non-isomorphic. 
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We prove at least the following, showing diversity in the constants even for fixed 
edge density: 

Theorem 1.7. For any fixed d >2 and rational r > 1, there are infinitely many 
distinct constants fijj over connected graphs H with edge density = r. 

Our final separation result concerns the linear programming relaxation of the TSP. 
The TSP through a set of points can be given as the following integer program, on 
variables a:{y} {i,j GV,i< j) indicating the presence of an edge between vertices 
i and j in the tour, where gives the cost of the edge {*, j}: 


min ^ C{^J}a;{y} 

{i.iicy 

subject to 


(Vi) J^xfij}=2 

jVi 

{V0^ SC V) E X{ij} < |5'| - 1 

{ljIcs 

(Vi <j€V) e {0,1} 


The linear programming relaxation of the above integer program replaces the final 
constraint with the requirements that 0 < x^ijy < 1 for each It is often 

referred to as the Held-Karp relaxation, of the TSP, but its origins go back to the 
the paper of Dantzig, Fulkerson and Johnson [9]. It should be noted that, although 
this LP has an exponential number of constraints, it can be solved in polynomial 
time, e.g., using the ellipsoid algorithm [14]. 

We denote by HK(T„) the value of a solution to the Held-Karp relaxation on Xn- 
Of course, HK(A’„) < TSP(A’„). The Held-Karp bound on the TSP is generally 
considered to be a good bound which is algorithmically useful on ‘typical instances’ 
(see [6, 15, 19, 24, 31]), and, with this motivation, Goemans and Bertsimas [13] 
showed that HK(A’„) ~ /3hkR^ for some constant /3hk, and asked whether /3hk = 
/3tsp- Experimental evidence has suggested that any gap between the constants 
would be less than 1%. [20] 

Theorem 1.8. For all d>2, < /^xsp- 

Our separation results have implications for the practical problem of solving the 
Euclidean TSP. Branch and bound algorithms are a standard approach to solving 
NP-hard problems, in which a bounding estimate is used to prune an exhaustive 
search of the solution space. There has been a great deal of success solving real- 
world instances of the TSP with branch-and-bound augmented with sophisticated 
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techniques based on cutting planes for the TSP polytope (see, for example Apple- 
gate, Bixby, Chvatal and Cook [1]). 

Our results show, however, show that several natural approximators (2-factor, twice 
a matching, the Held-Karp bound, etc) are asymptotically distinct from the TSP. 
This will be algorithmically relevant as follows: 

Theorem 1.9. Suppose that we use branch and bound to solve the TSP on Xn, 
using TFg or HK as a lower bound. Then, w.h.p, the algorithm runs in time 

log® n) ^ 

In particular, this gives a rigorous explanation for the observation (see [22], for 
example) that branch-and-bound heuristics using the Assignment Problem as a 
bounding estimate (even weaker than the 2-factor) perform poorly on random Eu¬ 
clidean instances, and indicates that the success of the Held-Karp bound in branch 
and bound algorithms will be limited for sufficiently large Euclidean point sets. 

Remark 1.10. Asymptotic formulas are available for subadditive Euclidean func¬ 
tionals in more general settings. If xi,X 2 ,‘-‘ € are independent identically 
distributed random variables with bounded support, then the length L{xi, ..., Xn) 
of the functional on the points Xn satisfies 

L{xi,...,Xn)/n^ ^ Pi f{x)^dx, 

where / is the absolutely continuous part of the distribution of the x^s and /3f is a 
constant depending only on d and L (see [3, 25]). Note that this gives an asymptotic 
formula for L(xi,... ,Xn) unless the right hand side is zero. The latter case will 
happen if the x^s lie exclusively on some m-dimensional manifold embedded in M®* 
where m < d, but the BHH theorem also has a suitable extension to this setting 
[8], allowing asymptotic formulas involving n ™ . Our results are all immediately 
valid in these more general settings, however: as the constants Pi depend only on 
L and d or m (in particular, not on the distribution or, in the second case, the 
particular manifold), it is enough study the constants in the case of points which 
are uniformly distributed in the hypercube. 

2. SUBADDITIVE EUCLIDEAN FUNCTIONALS 

Steele defined a Euclidean functional as a real valued function L on finite subsets 
of which is invariant under translation, and scales as L{aX) = aL{X). It is 
nearly monotone with respect to addition of points if 

(4) L{XUY)> L{X)-oin"^) for n = [A]. 

It has finite variance if, fixing n, we have 

(5) Var(L(A„)) < oo 

(in particular, if it is bounded for fixed n) and it is subadditive if, for a random 
set of n points from [0,t]®*, it satisfies 

Liyn)< L{Sa,nyn)+Ctm’^-^ 
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for some absolute constant C, where here {S'a} {a G [m]^^) is a decomposition of 
[0, into subcubes of side length u = t/m. 

Steele proved: 

Theorem 2.1 (Steele [25]). If L is a subadditive Euclidean functional on of 
finite variance, xi,X 2 ,--- is a random sequence of points from [0,1]^^, and Xn = 
{xi,X 2 , ■ ■ ■ ,Xn}, then there is an absolute constant s.t. 

(6) L{X„)/n^ -)> a.s. 

In particular, we have a.s. that either T(A’„) = o{n ^^) (if = 0) or that 

(7) L{X^)^fin^. 

This can thus be used to easily give the existence of the simple asymptotic for¬ 
mulas for the functionals TFg(X), MSTfe(X), and IIE{X) by showing that these 
functionals are subadditive. We note that the constant /3f cannot be zero in any 
of these cases and thus that (6) implies (7), since for any 1 < z < n, 

E( min dist(a;i, x)) = n(n“3), 

X^Xn 

X^Xi 

giving a lower bound of Q{n~3 ~) for each functional by Linearity of Expectation. 

Proposition 2.2. TFg(X), MSTfe(X), and HF{X) are subadditive Euclidean func¬ 
tionals. 


Before writing a proof, we note that for the definition of the 2-factor functionals 
TFg(X), we can only require that the 2-factors whose length we minimize cover all 
the points when there are at least max(g, 3) points. Similarly, the HF{X) functional 
is required just to cover at least n — |i7| -I- 1 points. 


Proof. We begin by noting that for each of these functionals, we can assert an upper 
bound Cn~3~ for some constant C, even over worst-case arrangements of n points 
in [0,1]*^. The analogous statement for the TSP was proved by Toth [30] and by 
Few [10], and implies these bounds for the functionals considered here. Indeed, a 
tour through n points itself gives a tree of max-degree 2 (after deleting one edge), 
and is a 2-factor subject to any constant girth restriction. For H factors, a tour can 
be divided into paths of length jiJj (except for < \II\ remaining vertices) which can 
then be completed to instances of H' f) H by adding edges. Each added edge has a 
cost bounded by the length of the path it lies in and so this construction increases 
the total cost by at most a factor equal to the number of edges in H. 

Subadditivity of TFg(X), and IIF{X) is now a consequence of the fact that a union 
of 2-factors (subject to restrictions on the cycle length, perhaps) or iL-factors is 
again a 2-factor (subject to the same restrictions) or an H factor, respectively. 
In particular, the subadditive error term for these functions comes just from the 
fact that points may be uncovered in some of the subcubes Sa, for the exceptional 
reasons noted above. Since there are at most {g — F)ml^ or such uncovered 
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points, however, the error is suitably bounded by the minimum cost factor on a 
worst-case arrangement of the remaining points. 

Subadditivity of MSTfc(X) {k > 2) is similar: after finding minimal spanning trees 
of max-degree k in each subcube Sa, we must join together these trees into a single 
tree. We choose 2 leaves of each subcube’s tree and denote one red and the other 
blue. We let 01,0:27 • ■ • 7<am'‘ denote a path through the decomposition {5 'q}, so 
that the subcubes Sa- and are adjacent. For each i < m'^, we join the red 

leaf of the tree in Sa^ to the blue leaf in the tree of S'q.+i. The result is a spanning 
tree of the whole set of points with the same maximum degree and with extra cost 
at most 2^/dum'^ = 2y/dtm‘^~^. □ 

3. Separating asymptotic constants 

In the following we will use the simplest application of the Azuma-Hoeffding martin¬ 
gale tail inequality: It is often referred to as McDiarmid’s inequality [21]. Suppose 
that we have a random variable Z = Z{Xi, X 2 , . .., Xn) where Xi, X 2 , ■ ■ ■, Xn^ are 
independent. Further, suppose that changing one Xi can only change Z by at most 
c in absolute value. Then for any t > 0, 

(8) Pr(|Z-EZ|>t)<2exp|-^|. 

We will also use the following inequality, applicable under the same conditions, 
when E Z is not large enough. 

\ ocEi Z j c 

a) 

Our method to distinguish constants is based on achieving constant factor improve¬ 
ments to the values of functions via local changes. Given e, ZI € M and a finite set 
of points S C and a universe X, we say that T C A is an (e, U)-copy of S 
if there is a bijection / between T and a point set S' congruent to S such that 
I It — /(x) 11 < £ for all T € T, and such that T is at distance > D from X\T. Here 
we will further assume that ||t — y|| > e ior x ^ y G S. 

For our purposes, it will be convenient notationally to work with n random points 
from [0,t]'^ where t = in place of n random points Xn from [0, 1]'^. At 

the end, we will scale our results by a factor in order to get what is claimed 

above. 

Observation 3.1. Given any finite point set S, any £ > 0, and any D, a.s 
contains at least j^n {e, D)-eopies of S, for some constant > 0. 

Proof of Observation 3.1. Let Z denote the number of (£, ZI)-copies of S in 
We divide [0, tY into n/ {3DY subcubes Ci, (72,..., of side 3D. Then let (7' C Ci 
be a centrally placed subcube of side D. Now choose a set S' congruent to S 
somewhere inside (7[ and let Bi, B 2 ,..., Bg, s = jS”! be the collection of balls of 
radius £, centered at each point of S'. The with probability at least a = ,D > 0, 
each Bi contains exactly one point of and there are no other points of in 
Cl. Thus EZ > fin where fi = a/{3DY. Now changing the position of one point 
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in yn changes the number of (e, £))-copies of S by at most two and so we can use 
McDiarmid’s inequality [21] to show that Z > ^EZ a.s. □ 

To use this to prove Theorem 1.1, we will need just a bit more. 

Observation 3.2. If Y C and x lies in the interior of the convex hull of Y, 
then when D is sufficiently large, any point at distance > D is closer to some point 
ofY than to X. □ 

lIvo,vi,... ,Vk are vectors in with pairwise negative dot-product, then vi,... ,Vk 
lie in the half-space uq • a; < 0, and the projections oi vi,... ,Vk onto the hyperplane 
Vq-x = 0 have pairwise negative dot-products. This gives the following, by induction 
on d: 

Observation 3.3. If vi,... ,Vd+i € are vectors with negative pairwise dot- 
products, then 0 is a positive linear combination of the Vi’s. □ 


This allows us to prove: 

Lemma 3.4. If d1 < k < rfd), then there exists a set of points C 
consisting of a single point at the origin, surrounded by a set of k points 
on the unit sphere centered at the origin and separated pairwise by at least some 
e' > 0 more than unit distance, such that does not lie in open half-space whose 
boundary passes through the origin. 


Proof. We first observe that the definition of r' already gives us a set with 
the desired properties, except that it may all lie in some open half-space through 
the origin. In this case, however, we can delete a point and replace it with the 
point xh on the unit sphere opposite the half-space H, and furthest away from the 
halfspace. We do this repeatedly and note that because the above exchange of points 
only happens when all points are on one side of a half-space H', xr remains as the 
unique point which is in the open half-space opposite to H. Furthermore, doing 
this repeatedly, we can achieve either a set 5”^*^ with all the desired properties, or 
can find after at most k steps a set of points on the sphere separated pairwise 
by at least > 0 more than unit distance, and whose pairwise dot products as 
vectors in are all negative. But then Observation 3.3 and k > d-\-l implies that 
the points cannot all lie in the interior of some half-space whose boundary passes 
through the origin. □ 


We are now ready to prove Theorem 1.1. 

Proof of Theorem 1.1. Given fc > 2, we choose any d' < d such that d' -I- 1 < 

k < T'{d'). 

We apply Lemma 3.4 with k, d' to get a set C Observe first that the origin 
must lie in the convex hull X of the set 5"^^^ given by Lemma 3.4; otherwise, there 
would be a supporting half-space H oi X not containing the origin, and S ^ would 
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Figure 1. A configuration for forcing degree 2 in 2-dimensions. 


lie in the open half-space through the origin which is parallel to H, a contradiction. 
Now we take x {0}'^“'^ , and the origin is still in the convex hull of 

Now, letting denote a unit simplex centered at the origin (with d -|- 1 points), 
we let 

U = S^’^^U IJ {(1.5)p-f .1-A"^}. 

So U is a set of 1 -I- fc -I- (d -I- 1)A: points. (Figure 1 shows U for the case d = 2, fc = 2; 
note that in this case, d' = 1.) 

We now let denote an (e, D) copy of U, for sufficiently small e > 0 and 
sufficiently large D. Observe that since the origin is in the convex hull of the 
k small copies of the d-simplex in U ensure that the origin is in the interior of the 
convex hall of U, and thus also in the interior of for sufficiently small e. 

Observe that (for large D) the distance between any pair of points in an Ue^o is 
less than the minimum distance between and \ In particular, if T 

denotes the minimum length spanning tree on 3^„, the subgraph T[C4 _d] induced 
by the points in C4 ,d must be connected (and so a tree), or we could exchange 
a long edge for a short edge. Moreover, the minimum length spanning tree on T 
must restrict to a minimum length spanning tree on Ug^D, and by construction, the 
point of corresponding to the origin point in U has degree k in the MST on 
U. Finally, no points in A’n \ C4 ,d can be adjacent to the center of the star when D 
is sufficiently large, by Observation 3.2. Thus Observation 3.1 gives that ak,d > 0 
for d -I- 1 < fc < t' (d). 

Finally, ai^d > 0 is an immediate consequence of a^^d >0. □ 

Indeed, Theorem 1.2 follows immediately as well: 


Proof of Theorem 1.2. Suppose 2 < k < T'{d), and T is a minimum spanning tree 
of 3^n subject to the restriction that the maximum degree is < k. By Observation 
3.1 we have that there are Cn (e, D) copies of the set U from the previous proof, for 
some constant C, and from the argument above we see that each such copy Si will 
induce a (connected subtree) which will have maximum degree at most k in 

an instance of MST^. Replacing each r[5'i] by the optimum {k + l)-star produces 
a spanning tree of maximum degree fc -I- 1, whose length is less by at least some 
constant C'n. Rescaling by t gives that the length difference is at least C'n^^. □ 

Remark 3.5. The same argument allows us to separate /3 mst from Psteiner where 
the latter corresponds to the minimum length Steiner tree. We just need to use 
{e, D) copies of an equilateral triangle. We remark that adding the Steiner points 
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corresponding to the Fermat points of the copies will reduce the tree length. The 
details can be left to the reader. 


We turn our attention now to 2-factors. We begin with two very simple geometric 
lemmas: 

Lemma 3.6. Suppose that points p, q, r, s satisfy 

lb-9ll.lk-s|| > ^ and \\q - r\ \ < d, 
where S i.e A is sufficiently large with respect to S. 

Let 0{x;y,z) denote the angle between the line segments xy and xz. If 
uiax{6{p;q, s),6{s;p,r)} > 

then 

\\p-s\\<\\p-q\\ + \\r-s\\+6-^. 


Proof. We have 

lb - s|| < lb - q\\cos6{p; g , s) -I- 5 -b lb - s|| cos0(s;p, r). 

Now use cos a; < 1 — x‘^/3 for x < 1. □ 


Lemma 3.7. Suppose that points pi, qi, ri, Si, z = 1,2 satisfy 

(10) Ib» - 9»||, Ibi - Sill > ^ fori = 1,2 

and also that qi,ri,q 2 ,r 2 are contained in a ball of radius 6. Then there is a 
matching on {pi,P 2 , si, S 2 } whose total length is at most 

( 11 ) Ibi - <?ill + Ibi - Sill + Ib 2 - 92II + Ib 2 - S2II + 45 - lA. 


Proof. Without loss of generality we let the qi, ri be within distance 5 of the origin, 
and then let 0{x, y) denote the angle between x and y via the origin that is less than 
or equal to tt. There are three possible pairings of the points P = {pi,p 2 , si, S 2 }, 
and for at least one such pairing, 9{x,y) < ^tt for one of the pairs. 

Let us take {x, y} and {zc, z} to be the pairs in such a pairing of P, with 0{x, y) < 
^TT. We let T denote the triangle with vertices x,y,Q, let a,b,c denote the side- 
lengths, where a is length of the side opposite 0, and s denote the semi-perimeter 
{a + b + c)l2. Now a < (6^ -b and in fact 

6-bc — a>6-bc — (6^-b = (6 -b c) ^1 — ^1 — ^ ^ ^ 

be 1 . r7 1 1 a 

> - > -min{5, c| > - A. 

- 6-bc - 2 ^ ^ - 2 

Thus we find a pairing of P for which the total length is at most |bi|| + Ib 2 || + 
Ibill + Ib 2 || — ^A, and we will be done after applying the triangle inequality four 
times and using the fact that |bi||, |bi|| < 5 for z = 1,2. □ 
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Figure 2. When not all pairs are nearly straight the old 2- factor 
(left) can be shortened to a new one (right). (The dashed circle of 
radius e encloses 5 + 1=4 points.) 

Proof of Theorem 1.3. Let Fg+i be a minimum length 2-factor in whose 
cycles all have length >5 + 1. We let Ue,D C denote any set of g points of 
radius £ and at distance D from yn\Ue,D- Note that Lemma 3.1 implies that there 
are a linear number of copies of such sets. We now dehne as a collection of 

three instances t/i, C/ 2 , 1^3 of Ue,D, centered at the vertices of an equilateral triangle 
of sidelength 2D, and lying at distance A from 3/n \ 14 ,d,a; we will take D large 
relative to e and A large relative to D. 

We call a multiset of edges a 2-matching if every vertex is incident with exactly 
2 (not necessarily distinct) edges in the multiset. This is the same as a 2-factor, 
except that it can contain “ 2 -cycles”, which consist of a single edge included twice. 

We will begin by showing how to give a constant-factor shortening of the 2-factor 
Fg+i to a 2-matching F, without being careful to avoid creating cycles of length 
shorter than 5 . In particular, we prove the following lemma: 

Lemma 3.8. There is an absolute constant 6 such that for suitable choices of 
£ <C 1/ <C A, any instance ofV = 14,d,A allows a modification F o/Fg+i so that 

(1) F is a 2-matching; 

(2) F has weight at least 5 less than the length o/Fg+i; 

(3) Cycles of F lying entirely in V have length > g; 

(4) F is a local modification o/Fg+i, in the sense that any edges ofFg^i disjoint 
from V are still present in F. 

Again, Lemma 3.1 implies that there are a linear number of instances of 14 ,d,A in 
In particular, this lemma would be sufficient to argue that /^TFg < /^TEg+i, 
except that F may not have girth 5 . 


Proof of Lemma 3.8. For Ui = in V, there are (at least 2) edges in Fg+i from 
\ Ui to Ui, since 5 + 1 > 5 = |{7i|. We can pair these edges so that each pair 
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Figure 3. An instance of V^^d.a (here for g = 2, d = 2). When 
all pairs of edges entering/leaving U^’s are nearly straight, we must 
have at least 2 pairs of edges entering/leaving V, as shown here. 


lies on a common cycle of Fg+i, and so that the two edges in a pair are joined in 
Fg_|_i by a path through (possibly just 1 point of) Ui. Similarly, we can pair edges 
between V and \ V. (Some pairs for V may also be pairs for a Ui, others may 
not.) 

Now, by choosing D large relative to e, we can assume that each pair of edges for 
a Ui is nearly straight, in the sense that the angle between the endpoints of the 
edges in jVn via any point in U^^d is close to tt; otherwise, we can modify Fg+i 
by including all edges of some (/-cycle through Ui, and shortcutting each pair of 
edges between y„ \ Ui and Ui with a single edge between the endpoints in yn\Ui. 
(Figure 2.) The result has length smaller by a constant S = n(D^/^), see Lemma 
3.6. To ensure condition (3) for F, we must now also shortcut all remaining pairs 
of edges between V and \ V, delete any edges in V\Ui, and then add (/-cycles 
to the remaining Lf, ’s. (This step adds length which can be made arbitrarily small 
by decreasing e.) 

We may also assume that each Ui has only a single pair of edges. Otherwise, if 
there are two different pairs, we delete the edges in the two pairs, use Lemma 3.7 
to add a pair of edges among the 4 outside endpoints of the pairs of total weight 
which is less than the total weight of the pairs by a constant (note that we may 
have created a 2-cycle if one of these edges was already present, which is why F is 
only a 2-matching), shortcut all other remaining pairs between V and jVn, delete 
all edges within V, and add (/-cycles to each Ui. For sufficiently small e, we get a 
constant length improvement. 

Thus we may assume that each Ui in V has a single pair, and that the pair for each 
Ui in V is nearly straight. The crucial point is that this implies that there must 
be at least two pairs of edges joining V to \ since, e.g., edges joining Ui to 
U 2 and Ui to C /3 would not be nearly straight. Therefore at least one of the C/^’s 
has no edges to the other UiS. (See Figure 3.) We conclude, as in the previous 
paragraph, by deleting the edges in the two pairs, using Lemma 3.7 to find a pair 
of edges among the 4 outside endpoints of the pairs of total weight which is less 
than the total weight of the pairs by a constant, shortcutting all other remaining 
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pairs between V and deleting all edges within V, and adding (/-cycles to each 

U,. □ 


We must now address unintentional problems of girth. (Notice that, in shortcutting 
edges, we may have left behind short cycles—in particular, any 2 -cycles must be 
eliminated.) To this end, we say that V = 14,d.A is e-surrounded if the set Afy of 
points of \ y within distance 3A of V has the properties that: (1) each x € J\fv 
lies within distance £ of the sphere S of radius 2A centered at the center of V, 
and (2) each x G S lies within £ of Afy- (Essentially, Afv is an approximation 
to an £-net on S, which surrounds V). Lemma 3.1 implies that there are a linear 
number of £-surrounded E’s, and additionally, a linear number of £-surrounded sets 
V satisfying the requirements in the previous paragraph (each Ui has a single-pair 
of edges to the rest of y„, etc.). 

We now show that if V is £-surrounded, then there is an constant C'g,^, which can 
be made arbitrarily small by decreasing £, such that there is a 2-factor F' such 
that: 

(A) F' has total weight w{F') < w{Fg+i) + Cg^e, 

(B) every cycle in F' is still of length > 5 -I- 1, 

(C) All edges in F' incident with V either lie in V or intersect Afy- 

To produce F' from Eg+i, we consider each edge e = {u, u} from V to A’n\ (Ay UE) 
which does not intersect Afy, and 

(1) Locate a point x in A/y within distance £ of a point w on the edge e. Let 
C = {x = xi,X2, ■ ■ - Xk, Xk+i = xi) be the cycle of Eg+i that contains x. If 
u = Xi for some i, then we choose the cycle orientation so that v = Xi_i. 

( 2 ) Add the edges { 11 , 3 : 1 }, {xk,v} to the 2 -factor and delete the edges e and 
{xi,Xk}. 


This ensures (C) and the change in cost for this one substitution is 

llxi - u|| -I- ||a;fc - w|| - llxi - Xfcll - ||u - w|| - ||m - w|| 

<11x1 - u|| -I- IIti - w|| - llu - wll 

<2||xi - w\\. 

Thus dealing with all edges from 14, d,A to A’n\14,D,A increases the cost by at most 
I2ge, since there are 3g points in V and hence at most 6g edges from 14, d, A to 
yn\V,,DA- 

After this, any cycle in F' but not in Fg + l must contain an edge added in Step 
(2). But either u,v ^ {xi, ■ •., x^j, in which case the length of this cycle is at least 
k+2 > (/-I-3, or else u = Xi,v = and this cycle is xi, X 2 ,..., Xi_iXfcXfe_i • • • x^xi 
and so has length k > g + 1. 


We are now prepared to find a 2-factor Fg whose weight is smaller than Eg+i by 
a constant factor. For some small constant c, we have that there are at least cn 
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instances of e-surrounded V = T4,_d,a’s. We take these instances as Vi, V 2 ,..., in 
any order, and beginning with F = ^g+l and for each i = 1 , 2 ,..., we 

(i) Find F' for Vi as above (with weight increase which we make arbitrarily 
small) 

(ii) Apply Lemma 3.8 to shorten F' at Vi to Fq with a constant weight improve¬ 
ment 

(iii) At an arbitrarily small cost, modify Fq to a 2 -factor Fq which has girth g, 
by merging cycles intersecting the net Nvn and set F = Fq (explanation is 
below). 

In particular, to carry out Step (iii), note that any cycle C of length < g in Fq 
includes a point x of Ny, and we can merge C with the cycle through a point 
y within 2s of x, at an additional cost of < 2e: We join x and y, delete edges 
{x,x'} and {y,y'} incident with each in the previous 2 -factor and replace them by 
{x, y} , {x', y'} at a cost of 

\\x - y\ \ + \ \x' - y'W - \ \x - x'W - \ \y - y'W < 2\\x - y\\. 

After applying Steps (i)-(iii) for each F € V, the result is a 2-factor Fg = F of 
girth y, whose total weight is smaller than the total weight of Fg+l by a constant 
factor. □ 


The proof of the counterpoint Theorem 1.4 will be given in Section 4. For now 
we consider matchings. In fact. Theorem 1.5 can be viewed as a consequence of 
Theorem 1.8, via Proposition 5 of [13]. However, we also give a short self-contained 
proof. 


Proof of Theorem 1.5. We define the Euclidean functional 2MM(A) as the 
minimum length union of two matchings on X. Note that we make no requirement 
of disjointness and that we trivially have that 2MM(A) = 2 • MM(A) for all X. 
On the other hand, a TSP through X can be viewed as a (near)-union of two 
matchings (alternating edges around the tour, leaving one vertex unmatched if n 
is odd). Our aim will be to give a constant factor improvement to the union of a 
pair of matchings given by the TSP, to show that 2MM(3^„) is asymptotically less 
than TSP(jy„). To this end, we let Mi and M 2 denote a pair of matchings derived 
from the minimum length TSP. 

We let Ue^D denote a set of two points separated by distance at most £ and at 
distance at least D from all other points of Vn, and let Ve,D,F denote a collection 
of 5 instances C/i,..., 1/5 of C/£,_d, centered at the vertices of a regular pentagon of 
sidelength 2D, such that all other points of are at distance > F from this set. 
As before. Lemma 3.1 gives that there are a linear number of instances of 14, d,f 
for any fixed F, D, and £ > 0. Moreover, as before, if we have a linear number of 
instances L4 ,d in which a pair of edges of a matching leaves 14 ,d and is not nearly 
straight, then we can make a constant improvement to the matching, by joining 
the two points of and shortcutting the outside endpoints of the edges leaving 
Ue,D with a single edge. 
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Since Mi and M 2 are disjoint, the pigeonhole principle gives that for some s G {1,2} 
and at least three of the Ui’s in any the pair of points in Ui is omitted from 

Mg. In particular, we may assume without loss of generality that we have a linear 
number of for which the set / of indices i for which the points in Ui are 

unmatched in Mi has cardinality |/| >3. Moreover, from the previous paragraph, 
there must be a linear number of such I 4 ,d,f’s which also have the property that 
the pair edges leaving the Ui, i G I is nearly straight. In particular, as the point 
sets Ui {i G I) are not nearly collinear, we must have as in the previous proof that 
there are (at least) 2 pairs of edges entering and leaving t4,D,F- We conclude by 
applying Lemma 3.7 (with 2£, say) to get a constant factor improvement a linear 
number of times. □ 

Proof of Theorem 1.7. It suffices to show that for fixed r > 1, there are connected 
graphs H with r- \V{H)\ edges for which the constant f3fj is arbitrarily large, which 
we show by demonstrating that can be arbitrarily large even just over trees T. 
To this end, we let be the tree on /c + I vertices which has k leaves. 

Given any large constant u = t/m for some integer m, we decompose the [0,^]"^ 
cube with subcubes of side u. Now the number of points in each subcube is 
binomially distributed with mean Let a point in be good if the subcube 
Sa that it lies in has at least (1 — e)u'^ members of and the total number of 
points in the < 3^^ subcubes that touch Sa contain at most (1 + e)(3'u)‘^ members 
of where £ = Assuming that u is sufficiently large, the Chernoff bounds 
imply that a member of is good with probability at least 1 — e/2. Thus the 
expected number of good points in A’n is at least (1 — e/2)n. Now the Chernoff 
bounds can be used to show that the number of members of in any subcube is 
a.s. O(logn) and therefore, changing one point only changes the number of good 
points by O(logn) a.s. A fairly simple modification of McDiarmid’s inequality now 
implies that a.s. (1 — e)n of the members of A’n are good. 

Since « n/{k + 1) points must have degree fc in a Tfc factor of we have that 
there are at least n/{2k) good points which have degree k. Now let k = 2{‘iuY. 
Then a.s. a factor has length at least ^ • m > “. 

Rescaling the cube by a factor of t gives that the minimum Tk factor has 

length at least ^un~d~, and here | is an arbitrarily large constant. □ 

Proof of Theorem 1.8. We begin with some general observations regarding the 
shortest TSP through Euclidean point sets: 

Observation 3.9. Suppose that S^^u is an (£, D) copy of a fixed set S for fixed e 
and sufficiently large D, and that at least 2 pairs of edges of a shortest TSP tour 
C join Se,D to V \ Se^D- Then the pairs are nearly straight (i.e., the angle for each 
pair is arbitrarily close to tt as £ —>■ 0, and k,D ^ 00 ). 

(Here the edges of C joining Se,D to H \ Se,D are paired such that the endpoints of 
the edges in a pair which are inside are joined by a portion of L lying entirely 
in Ss,D-) 
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Proof. Otherwise, we shortcut the edge pair which is not nearly straight to obtain 
a constant improvement (which, for a hxed angle, can be made large by increasing 
D). The tour portion between one of the other edge pairs is modified to cover any 
vertices of which are now missed by the tour, at an increased cost which does 
not depend on I?. □ 

Observation 3.10. Suppose that is an (e, D) copy of any fixed set S for fixed 
e and sufficiently large D. Then there are at most 2 pairs of edges in a shortest 
TSP tour which join Sg^n to V \ Sg^o- 


Proof. Let C denote a shortest TSP tour, and suppose there are three pairs (ei, 62 ) 
( 715 / 2 ) and ( 31 ,( 72 ) of edges in £ between Sg^D and V = V \ Sg^n- We let 
Xi,X 2 ,yi,y 2 , Zi, Z 2 denote the endpoints of the edges ei, 62 ,/i,/ 2 , 3 i, 32 , respec¬ 
tively, which lie in V', and we suppose, without loss of generality, that the pairs 
2 ^ 1 , 31 , 32 , 22 , and 21 , 0 : 2 , respectively, are joined by paths in V" n £, for V" = 
V \ {xi,X2,yi,g2,zi,Z2}. 

We now modify £ as follows. 

(1) We remove the edges ei, 62 , /i, / 2 , 3 i, 32 • 

(2) We add new edges between the pairs (^ 2 , 32 ) and ( 31 , 2 : 1 ). 

(3) We add a path which travels from a;i to the set Sg^D (visiting every vertex 
of the set) and then to 2:2 • 

Observe that the result is a new TSP tour; it remains to estimate the change in cost. 
The path P added in part 3 has Euclidean length £{P) at most £{ei — 32 ) -I- Cs,g,D, 
where we are viewing the edges as vectors from their point in V" to their point in 
Sg D, and where Cs g is a constant depending only on S.s. (For example, we can 
take Cs,g = TSP{Sg[D) + diam(S').) 

Similarly, the edge (^ 2 , 32 ) has length at most £{62 — f 2 ) + Cs,g,D, and the edge 
( 31 , 2 i) has length at most £{fi — 31 ) -I- Cs,g,D- Applying the triangle inequality to 
the three lengths immediately gives that our new tour has length at most TSP(A’„)-|- 
SCs^g^D- In fact, we should be hoping to do better: If £{62 — f 2 ) is within a constant 
of ^( 62 ) -l-^(/ 2 ) as D grows large, then the points 0 : 2,32 are constrained to be nearly 
antipodal about center c of the set Sg^o (with angle tending to tt as —)■ 00 ). 
Similarly, we have that £{fi — 31 ) is far from £{fi) + £{gi) unless 31 , 2 i are nearly 
antipodal. 

Thus if we have not achieved a contradiction by shortening the tour, then, ^ 2,32 
are nearly antipodal, and 31,21 are nearly antipodal. Observation 3.9 means that 
the pairs {a;i,a; 2 }, { 31 , 32 }, and { 21 , 2 : 2 } are nearly antipodal as well. Thus, in 
particular, xi and 2 i are not nearly antipodal, and so we can produce a shortening 
of the original TSP tour by instead: 

(1) Removing the edges ei, 62 ,/i,/ 2 , 3 i, 32 ; 

(2) Adding new edges between the pairs (a;i, 2 i) and ( 0 : 2 , 22 ); 
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Figure 4. The set Such sets are good for the HK-bound, 
but bad for the tour. (Thick lines have weight 1, while thin lines 
have weight ^.) 


(3) Adding a path which travels from yi to the set Ss,d (visiting every vertex 
of the set) and then to 1/2 ■ □ 


We now consider the d = 2 case of Theorem 1.8. We let S'* be a set consisting of 
k equally spaced points on a unit circle centered at the origin, 2 k equally spaced 
points at the radius 4 circle centered at the origin, and the points (2,0) and (—2, 0) 
(See Figure 4. The particular ratios 2k : k and 4:2:1 are chosen just to make a 
clean hgure.). We will argue that if c is sufficiently small and D, k are sufficiently 
large, each instance of an (e, D) copy of S* allows us to locally modify an instance 
of the TSP so that it is still a solution to the HK linear program, but is shorter by 
some additive constant. 

To this end, let £ be some shortest tour, and S'*^, be an (c, D)-copy of S'*. Observa¬ 
tion 3.10 implies that £ enters (and leaves) 5*^, either once or twice, for sufficiently 
large D. 


Case 1: The tour enters once. We let xi,X 2 denote the vertices in V' = 
V \ D which are adjacent to vertices in S*^). We let £° be the portion of £ 
between xi and X 2 . Then we have that 

(12) i{£°) > dist(xi, S*£)) -f dist(S*£), 0 : 2 ) -f IOtt + & K — o(l), 

where we are using o(l) to denote a function which tends toOaser—>-0 , k ^ 00 , 
and H —>■ 00 simultaneously, and K is an absolute constant (in fact, K can be 2). 
To see this lower bound, observe that the tour must cover the outer circle (ss Stt), 
the inner circle (« 27r), and must cross the gap between the inner and outer circles 
twice (2-3 = 6). Finally, the tour must also spend more (bounded below by some 
constant K) to cover one of the two “gap” vertices in S'*^,. To see that the tour 
can not cover both gap vertices in gap crossings while crossing the gap only twice, 
let 01,02 denote the first and last vertices of £° on the inner circle. Either 01,02 
lie at the two ends of the inner circle close to the gap vertices, in which case the 
tour spends an additive constant K more than 27r — o(l) to cover the entire inner 
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circle, or, say, ai lies at distance 1 + K from the gap vertex to which it is joined, 
incurring an extra cost K again. 

We now modify the portion C° so that the result is still a solution to the Held- 
Karp LP, but is shorter by some additive constant. We let yi,y2 and Zi,Z2 be 
pairs of points on the outer circle which are closest to the gap vertices gi and 1/27 
respectively, and similarly let ai , 02 and bi , 62 be points of the inner circle which are 
closest to the first and second gap vertex, respectively. We join all pairs among each 
of the triples yi, 2/2, ffi, ai, 02, ffi, 21,2:2,32, 61, &2,52 by edges of weight 5 (Figure 4 ). 
Next, we let ai, 027 013 be a consecutive triple of points on the outer circle which is 
as close as possible to Xi (but disjoint from the set {yijy2, Zi, Z2}), and / 3 i,/? 27/33 
be a consecutive triple of points on the outer circle which is as close as possible to 
X2 (but disjoint from the set {yi,y2, zi, Z2,ai,a2j as}). We join all pairs among 
each of the triples ai, 027 as and Pi, h, Ps with edges of weight We join a2 and 
P2 to Xi and X27 respectively, by edges of weight 1 . Finally, using edges of weight 1 , 
we join all consecutive pairs of points on each circle which were not already joined 
(by edges of weight j). The result for is shown on the right-hand side of Figure 
4 . As k, D grow large and e grows small, the total cost of this is 

dist(xi, S'g £,) -I- dist(5'*£i,a;2) -f IOtt - 1 - 6 - 1 - o(l), 

thus it suffices to check that what we have given is indeed a valid instance of the 
Held-Karp LP. We carry out this verification below, after describing the second 
case of the construction. 


Case 2 : The tour enters twice. We let x\,x\ and x\,x‘^ denote the two pairs 
of entry/exit points in F' = y \ We let be the portion of L between x\ 

and x\, C? be the portion between x\ and x^, and let C° = U Then we have 
that 

( 13 ) i{C°) > dist{x),Slo) + IOtt -b 6 -b a: - o(l), 

i=l,2 

i=l,2 

where K is again an absolute constant (and where again, in fact, K can be 2 ). 
Again, IOtt 4-6 is needed to cover both circles, and transition to the inner circle and 
back. If both and visit the inner circle this gives an extra cost of « 6 for the 
transitions, so we assume that is the only portion to visit the inner circle. But 
the argument from the previous case shows that cannot cover the entire inner 
circle and visit both gap vertices without incurring an additive constant extra cost. 

We now modify the portion C° so that the result is still a solution to the Held-Karp 
LP, as follows. We still let 31,32 and Zi,Z2 be pairs of points on the outer circle 
which are closest to the gap vertices 31 and 32, respectively, and similarly let oi, 02 
and bi , 62 be points of the inner circle which are closest to the first and second gap 
vertex, respectively. We join all pairs among each of the triples 31,32, 3 i, Oi, 02,31, 
zi,Z2,g2, bi,b2,g2 by edges of weight Next, we let a\,a2,al be PI, PI, PI be 
consecutive triples of points on the outer circle which are close to the points x\ and 
X2, respectively. (All named points must be distinct) We join all pairs among each 
of the triples al,a2,al and P\,P2,Ps with edges of weight ^ for i = 1 , 2 . We join 
al and P2 to x\ and x^, respectively, for i = 1 , 2 , by edges of weight 1 . Finally, 
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Figure 5. An example of the set with two passes of a tour. 
(Thick lines have weight 1, while thin lines have weight |.) 


using edges of weight 1, we join all consecutive pairs of points on each circle which 
were not already joined (by edges of weight ^). The result for is shown on the 
right-hand side of Figure 5. As k, D grow large and c grows small, the total cost of 
this is 

dist(x*, S'g £)) -I- IOtt -1-6-1- o(l), 

1 = 1,2 

i = l,2 

and so we have improved the length by an additive constant. 


Feasibility of the solutions. We now check that making many local modifications 
according to the cases above does not disturb the property that £ is a feasible 
instance of the Held-Karp LP. It is immediate that all degree-weights ^{ij} 
are 2; it remains to check the condition that 

(14) (V0 C S' C V) E ^{ij} ^ l'5'l 1- 

{*J}CS 

In other words, the total weight of edges in any proper nonempty subset is at least 
one less than the number of vertices. 

Since the degree-weights are 2 at every vertex, we can show (14) by showing that 
any proper nonempty subset S has the property that edges of total weight at least 
2 leave the subset; i.e., that 

(15) (V0 C S C V) E ^{ij} — 

ies 

If this fails, there is a cut in the graph of weight < 2. First we consider the possibility 
that the cut includes an edge of weight Such edges only occur in triangles, and 
a minimum cut in a graph cannot contain exactly one edge of any triangle. But 
this already implies that no cut of weight < 2 in our graph can include any edges of 
weight since deleting even all the edges of any one triangle does not disconnect 
our graph, and since 4 such edges, or 2 such edges plus an edge of weight 1, already 
gives weight 2. 
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But now we are done: since no minimum cut of weight < 2 can include an edge 
of weight i (and all other edges have weight 1), it suffices to note that there is no 
single cut edge in our modified graph. This completes the proof for the case d = 2. 


The case d > 2. We cannot use exactly the same point set in higher dimensions. 
The only trouble with the previous argument is the lower bound on the tour length. 
In particular, if an edge enters a set S'* at a sharp angle to the 2-D plane containing 
S*, then it may join to an inner circle vertex or gap vertex at negligible (or zero) 
extra cost over entering at the nearest possible point, and this is not accounted for 
in (12) or (13). Our goal now is to create a set out of many copies of S*, in a way 
that allows us to be certain that some copy of S* must be incident only with edges 
nearly parallel to its containing plane, in any optimal tour. 

We will let ^ be an {e,D) copy of a certain elaborate set The set 
consists of £ copies of S*. Each copy of S* lies in a 2-dimensional hyperplane normal 
to some unit vector v, and we orient the copies of S'* so that these hyperplanes are 
tangent to the {d— l)-sphere Sr of radius R. Moreover, we ensure that the centers 
Cl, ... ,c^ lie on Sr, and are roughly “evenly spaced” in the sense that as £ grows 
large, we can ensure that for any given unit vector x € Si, we have that x lies 
arbitrarily close to j^Ci for one of the Cj’s. 

We now appeal to the following fact: 

Lemma 3.11. If X is a subset of the unit (d — l)-sphere Si in of cardinality 
n, then TSP(X) = o(n). 


Proof. We can rescale the sphere and embed it in the hypercube [0,1]*^, and then 
appeal to the fact (see e.g., [10, 26]) that the worst-case length of a tour through 
n points in [0, l]'^ is < CdU^sr for some constant Cd- (It is also easy to obtain a 

d-2 

bound for the sphere, but this is unnecessary for us.) □ 

In particular, this lemma implies that if we take R ^ £ ^ k, then typical edges 
joining pairs of instances of (approximate) S*’s in the set S^^ are of length o{R) 
(where the asymptotics are as i? —> oo). Furthermore, Observation 3.10 implies 
that the tour enters S'^^ at most twice, and also that it enters each S'* at most 
twice. In particular, for sufficient choices R ^ £ ^ k, there will necessarily be at 
least one set X, which is an approximate S* in the set S*’^ ^ such that 

(1) All edges entering/leaving X go to other points of and 

(2) All such edges have length o{R). 

In particular, the edges entering and leaving X will be nearly parallel to the 2- 
dimensional hyperplane in which X (approximately) lies. This is because 

(a) the portion of the sphere within o(i?) of X will lie within a small angular 
distance of X, and moreover. 
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(b) the portion of the c-neighborhood of the sphere within distance o{R) of X but 
also at distance at least some large constant from X is also within a small 
angular distance of X. (Observe that points in r\^ £^re a large constant 
distance from X by our choice of i? ^ 

But this is now sufficient to ensure that we can modify the tour in X (according to 
Case 1 or 2 from above) to obtain a constant additive improvement. □ 


4. The 2-factor limit 


Here we prove Theorem 1.4: limg_>,oo = /3 tsp- continue to work with 

as above. We divide [0,^]*^ into subcubes Sa,a G [m]‘^ of sidelength 

L, for some sufficiently large constant L > 0. 

With each cube Sa we associate the 2‘^ quadrants Qa,j,j = 1,2, ...,2 "^, whose 
origin is the center Sa of Sa- We call the quadrant Qaj trivial if the quadrant 
intersects [0, t]'^ in a unit cube (in which case Sa is one of the 2*^ corner cubes in the 
decomposition). Then for a non-negative integer r, we let Qa,j,r denote the cubes 
in Qa,j whose centers are at distance at most rL from for convenience, we call 
Qa,j,r trivial (resp. nontrivial) whenever Qa,j is, regardless of the choice of r. 

If Qa,j,r Q [0,^]“^ is nontrivial and Yaj^r is the number of points of that are in 
Qa,j,r then Yaj^r is a binomial random variable with mean 

advL < EYa.i.r < Pd{rLY 

for some constants ad, Pd > 0. Note that, away from the boundary cubes of the 
decomposition of [0, t]'^ we can use (rL)*^ in place of rL for the lower bound, but in 
the worst-case, we have to reduce the exponent. We can therefore write 

(16) Pr{Ya,j,r = 0) < 

for some jd > 0- 


Next let Vr denote the number of subcubes Sa for which there exists j, r such that 
Qa,j is nontrivial, Qa,j,r C [0,t]‘^, and Yaj,r = 0. Then 

(17) Eu,. < 


for some > 0. We deduce from the above that 

2 

(18) = 0 for r > ro = — (7j^(logn -I- (ilog2). 

1j 


Now v-r is determined by n independent choices for the points in Changing 
one point changes Vr by at most Sdr^^ for some 5d > 0. Applying McDiarmid’s 
inequality we see that if t > 0 and r < rg then 




(19) 


Pr(ur >Ei'r + t) < exp 
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If Ei/r > then (17) implies that r < ^ ^ logn for L sufficiently 

large. It follows from (19) with t = Ei^r that 

(20) i^r < a.s. if 'Ei^r ^ . 

When EZ < we use (9) with a = j(Ev^) and c = = log*^^^^ n to 

obtain 

(21) Vy. < a.s. if 


Now suppose that Ci, ( 72 ,..., Cm are the cycles of a minimum cost 2-factor, where 
\Ci\ > g lor i = 1 , 2 , ...,M. Suppose first there exist i,j such that there exist 
Sp 5 X G Ci and Sq 5 y G Cj such that ||sp — Sg|| < L^- Suppose that (x,x') is 
an edge of Ci and that (y,y') is an edge of Cj. Then ||a: — ?/|| < (L + 2d^/^)L and 
\W ~ v'W Cl l|a^^ ~ a;|| + llx — j/|| + ||y' — y\\. It follows that if we delete the edges 
{x,x'),{y,y') from Ci,Cj and add the edges (x,y), {x',y') then we create a single 
cycle out of the vertices of Ci U Cj at a cost of at most 2||x' — ?/||. By repeating 
this where possible, we obtain a new set of cycles (7(, ..., C'j^, such that for two 

distinct cycles C', Cj the set of subcubes visited by C' have centers that are distance 
at least LC from the centers of the set of subcubes visited by (7). Furthermore, the 
increase in cost associated with this merging is at most 

( 22 ) 2(L^2pL^ 

We continue merging cycles. For r = L + 1...,, ro we try to merge cycles C, C 
for which there is a subcube Si containing a point of C whose center is within rL 
of the center of a subcube that contains a point of (77 The cost of making these 
merges can be bounded by 
ro 

(23) 2 < 3n(2L)'^e-^‘'-^" 

r—L+1 

for L sufficiently large. 

This is because, when we merge two cycles via subcubes at distance rL we are using 
one of at most i^r subcubes. Further, for each such subcube there are at most r‘^ 
other subcubes at distance r. 

We argue next that after all of these merges, there can be only one cycle. Suppose 
that there are two cycles (7, C and let x G C,x' G C be as close as possible. 
Suppose that x G Sa and that x' G Sb where ||so — s&|| > "Co- If this happens then 
we can find a Qa,j,ro or a Qb,j,ro that is empty, contradiction. 

It follows from this and (22), (23) that with L = that after scaling to [0,1]"^ 
we find that for g sufficiently large, 

and this completes the proof of Theorem 1.3. □ 
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5. Branch and Bound Algorithms 

In this section we prove Theorem 1.9. Branch-and-bound is a pruning process, 
which can be used to search for an optimum TSP tour. Branch-and-bound as we 
consider here depends on 3 choices: 

( 1 ) A choice of heuristic to hud (not always optimal) TSP tours; 

(2) A choice of lower bound for the TSP (such as the 2-factor, HK bound, etc.); 

(3) A branching strategy (giving a branch-and-bound tree). 

As an example, we will consider the case where we use some heuristic for the TSP, 
the 2 -factor as a lower bound, and use a branching strategy based on the 2 -factor 
as well. 

Given our point-set An, we begin by letting B be the value of the tour found 
by our TSP heuristic. We let b^ be the length of the shortest 2-factor in A„. 
Here x represents the root of the branch-and-bound tree, which we will construct 
iteratively; A^, is the set of all TSP tours in A„. Unless b^ > B, we do not know 
that B is an optimal tour, so we branch in the following way: we choose some cycle 
C in the 2-factor we have found, and, for the edges ei,...,efc of C, generate k 
children xi,... ,Xk oi x, letting 

Ixi — 0) 1x2 ~ {®1 }) ^X2 = {Sl j 62}) ■ ■ ■ 
d^Xi {^ 1 } 7 ^X2 {^2} ; ^X2 {^ 3 } 7 • ■ ■ 

These are sets of required inclusions and exclusions, respectively. In particular, 
for any vertex v of our tree, A^, is the set of tours containing all edges in ly and 
avoiding all edges in Oy. (For the root, we had Ix = Ox = 0.) Thus, in this 
example, the vertex x^ corresponds to the set of TSP tours which do contain ei 
but do not contain 62 . For each Xi, we use our TSP heuristic to find a tour, with 
the additional constraints that the tour includes all edges in Ix^ and excludes those 
in Oxi ■ Whenever we find a TSP tour shorter than the current value of B, we 
update B. We also, for each Xi, let bx^ be the minimum-length 2-factor subject to 
the constraints IxijOxi- For any bx^ for which bx^ is at least B, we know that no 
shorter tour than B exists subject to Ixi, Ox^, and the tree is pruned below Xi, so 
that Xi becomes a leaf of the pruned branch-and-bound tree. For other vertices, we 
continue to branch as above, by adding further constraints to kill some other cycle 
of the minimum 2 -factors found. 

This process terminates when the set L of leaves of the pruned branch-and-bound 
tree satisfies v € L => by > B; such a tree corresponds to a certificate that the 
best TSP tour found so far by our heuristic is indeed optimum. 

In general, as a branching strategy, we allow any method to produce, given an input 
weighted graph, a rooted tree (the branch-and-bound tree) labeled with sets ly, Oy 
of edges from the graph such that: 

(1) When u is a child of u, ly 0 ly and Oy 0 0„. 

(2) If the children of u are Vi,... ,Vk, then we have A„ = (Jfci ^vi ■ 
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(3) The leaves of the (unpruned) branch-and-bound tree satisfy |A„| = 1. 

Following any such branching strategy and pruning when by > B will eventually 
lead to a proof that an optimum tour has been found (assuming a reasonable TSP 
heuristic and lower bound), since, in the worst case, the heuristic and lower bound 
will match on the leaf v for which A^, contains just the optimum tour. For branch- 
and-bound to be efficient, we would hope that all but polynomially many vertices 
of the branch-and-bound tree can be pruned because of inequalities by > B. 

We can restate Theorem 1.9 more precisely as follows 

Theorem 1.9 (restated). For any TSP heuristic, any branching strategy, and a 
lower bound heuristic which is TFg or HK, the pruned branch-and-bound tree will 
have leaves a.s. 


In particular, in our proof of Theorem 1.9, we will make the most optimistic assump¬ 
tion regarding the TSP heuristic: we will simply assume that it always returns an 
optimum tour {B will always be the true value of the minimum TSP). Theorem 1.9 
asserts that even in this case, there can be no polynomially-sized branch-and-bound 
tree which certifies optimality w.h.p. 

One natural strategy to prove that L is large would be to show that each A„ (v G L) 
is small; then, A = A^ would give that L must be large. To show that A„ is 
small, one can hope to argue that to have LB(A’„|/„, 0„) > TSP(A’„), either /„ or 
Oy must be large, severely restricting the number of tours in A„. The problem is 
that while large does restrict the size of A„ considerably, having a large Oy can 
be a rather weak restriction. 


We will thus modify this basic approach by paying attention to a special set of tours 

A. Given the point set Xn, we will consider the division of [ 0 ,1]'^ into s = 

1 

boxes of sidelength ^ '°s " ^ where K is at least some sufficiently large constant. 
Bi,B 2 ,. ■ ■ ,Bs denote these boxes, taken in some order such that consecutive terms 
are adjacent (i.e., sharing a {d— 1)-dimensional face). Note that 


lx 


y\ < '/d 


AT log n 


a x,y lie in the same box. 


We consider Xn = {xi, X2, ..., x„}, and, for for each 2 < j < s — 1, we let 
x],x|,x|,x| denote the four points Xi G Xn H Bj of smallest index (this par¬ 
ticular choice is arbitrary, and is just for definiteness). We also choose points 
xf,xf G Xn D Bi and x\,x‘l G XnO Bs, again by simply choosing points of mini¬ 
mum index. Letting 

(24) 1= {xl,xj I 1 < i < s}U{xl,xj I 1 < i < s}) 

the points in I can be viewed as preselected “interface points” between the boxes 
Bj . In particular, we let A denote the set of TSP tours in Xn with the properties 
that, in the tour. 
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( 1 ) xf is joined to xf by a path lying entirely in Bi; 

( 2 ) for 1 < j < s — 1 , Xj and are adjacent; 

(3) for 2<j<s — 1, xj is joined to Xj by a path lying entirely in Bj; 

(4) xl is joined to Xg by a path lying entirely in B^; 

(5) for s > j > 2, and are adjacent; and 

( 6 ) for s — 1 > j > 2, a;| is joined to xj by a path lying entirely in Bj. 

Note that tours in A use only edges of length < 2\/d ^ ^ ^°g ” ^ , Instead of using the 
full strength of the condition A = A^, we will use the weaker (but apparently 

more useful) condition 

A = U A„ 

v&L 

where A^, = A H A„. Intuitively, we are focusing our attention on a restricted set of 
tours (chosen such that the value of LB at relevant leaves v £ L with nonempty A„ 
will be close to its typical length), and this restricted set A has the property that 
the allowable set of edges at each vertex is now small enough that having a large 
excluded set really will force A„ = A n A„ to be small. 

Our proof will require us to analyze the performance of LB conditioned on the 
exclusions and exclusions Thus we begin by proving that some simple 

operations preserve the property of being an feasible instance of the Held-Karp LP. 

Lemma 5.1. Suppose we are given a feasible instance of the Held-Karp LP on 
a set X of cardinality n. Then the following operations will all result in feasible 
instances of the Held-Karp LP: 

(1) Downweight all edges by a factor of {1— ^), add a new pair of vertices y, z 
joined by an edge of weight 1, and join each of y, z to all vertices in X by 
edges of weight ^. 

(2) For k < ^, downweight all edges by a factor of (I — ^), add k new vertices 
yi,... ,yk, and join each yi to all vertices in X by edges of weight ^. 


Proof. It is immediate for both operations that the (weighted) degree of every 
vertex is 2; we need to check that 

(25) y 0 CSCV S(S) := S-.sUI-i- 

eCS 

For Operation I, observe first that if 0 C S' C X, we have S(S) < (I — |^)(|<S'| — 1) 
and S{X) = |X| — I. For 0 C S C X, we have 

S(SU M) < (I - ^)(|S| -1) + |S|^ = |S| + ^ -1 < |SU Ml -1. 

and 

S{SU{y,z}) < (1 - ^)(|S| - I) + |S|^ + 1 = |S| + ^ < |SU {2/,4| - I. 
And S{X U M) = S{X U {z}) = |A|. 
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Finally, for Operation 2, we have that S{S) < (1 — ■|^)(l>5'| — 1) if 0 C S' C X, and 
S{X) = (1 - ^)(|X|) = |X| - fc. If y c {yi, y 2 ,..., 2/4 and 0 c s c X then 


(26) S(S u r) < (1 - ^)(|S| - is^x) + 41 • 41 • m 


1^1 

= 41 


(24l-fc)4l 

Al 


-1 


s^x 


1^1 
1 - 


Al 


< 41 + (41 - l|F|<fc) -- Is^X (^1 - 


41 


and we see that {Y C {yi,... ,yi.} V S ^ X) 


5(sur) < 41 + 41 - 1. □ 


To know that the lower bound LB = HK performs well at leaves v & L such that 
A„ 7 ^ 0, we will also want to patch several smaller solutions to the Held-Karp LP 
into a single global solution using the same edges which tours in A use to cross the 
B^s: 

Lemma 5.2. Suppose we are given feasible solutions to the Held-Karp LP on dis¬ 
joint sets Xi,X 2 , ■. ■ ,Xs of cardinalities ni,n 2 . ■ ■ ,ns. Write Xi = {xj,... ,a;"4> 
and suppose that the edge {xj,xf} has weight 1 for all 2 < i < s and that the edge 
{x^,xj} has weight 1 for all 1 < i < s — 1. Then we can patch these solutions into 
a feasible solution on ULi by 

(a) Deleting all the edges {xl,x^} {2 < i < s) and {xf,xf} (1 < i < s — 1) 

(b) Joining x^ to xLi md xf to x'f_^_i with edges of weight 1, for all 1 < i < s — 1 

Proof. By induction on s, it suffices to handle the case s = 2. (For s > 2, first patch 
together the Xi,..., Xs_i, to obtain a Held-Karp solution containing {xl,xl} and 
with weight 1 , then patch this with Xg.) 

It is apparent that after the patching operation, the degree of every vertex is still 
2. Suppose now that 0 C S' C U X 2 and write Si = S D Xi for i = 1,2. 
Let 0 < p < 2 denote the number of edges from among {xi,xl}, {xf,X 2 } whose 
endpoints are both in the set S. Note that 

S(S) = S(Si)+S(S2) + p. 

We let S{Si) denote the weight of the HK instance on Si before the patching 
operation (of course, S{Si) < S{Si)) and consider cases: 

Case 1: p = 0 

We have Si 7 ^ Xi for some i € {1, 2}. For at least one i, Si ^ 0, giving 

S(S) < s(Si) + 5 ( 82 ) < 4il + 42I -1 = 41 -1- 

Case 2: p = 1 

Now p < 2 implies that Si 7 ^ Xi or S 2 7 ^ -^ 2 , while p > 0 implies that Si 7 ^ 0 and 
S 2 7 ^ 0. If in fact both Si 7 ^ Xi and S 2 X 2 , then we have 

5(s) < s(Si) + s(S2) +1 < 4i| -1 + 42I -1 +1 = 41 -1. 

while if (without loss of generality) Si = Xi and S 2 ^ X 2 , then the deletion step 
(a) implies that S(Si) = S(Si) — 1, and so we have 

S(S) < s(Si) -1 + S(S2) +1 < 4il -1 + 42I -1 +1 = 41 -1- 
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Case 3: p = 2 

In this case, we have that the deletion step (a) implies that S{Si) < S{Si) — 1. In 
particular, we have 

SiS) < S{Si) - I + S{S 2 ) -1 + 2 = S{Si) + 5 ( 82 ) 

and thus 5(5') < |5| —1 if 0 7 ^ 5 7 ^ X 1 UX 2 , since we must have either 0 ^ Si ^ Xi 
or 0 7 ^ 52 7 ^ ^2 {Si = 0 is not possible since p > 0 ). □ 

Next we prove a concentration lemma, which is a simple modification of what 
appears in [27]. This will allow us to argue that modest conditioning does not 
significantly alter the value of LB in the leaves of interest. 

Let J C {I,..., n} be any fixed set of indices, and let Xj C Xn = {xi,X 2 , ■ ■ ■, Xn} 
denote the random set {xj \ j G J}. Recalling I from (24), we let 

LB,(J) =LB((R,nXj)\I). 

In particular, LBi( J) is the value of LB(Xj) restricted to the box Bi, after throwing 
away the (two or four) special points x ^. 

Lemma 5.3. For LB(J) = X]iLBi(J), where LB = TFg or HK, and |J| = ry, we 
have 

(a) ELB(J) < + o(ry('^-i)/^). 

(b) There is an absolute constant c > 0 such that, 

Pr(LB( J) > E LB( J) + 0 < exp |- } . 


Proof. 

(a) Suppose that Xj n Bi contains points for i = 1, 2,..., s. Then, in particular, 
Xj n i?i \ I is rii — 2 or Ui — 4 for each i, and we have 


s Q {d—l)/d 

ELB(J)<(I4-«(1))1:*£L 

where we have used Jensen’s inequality. 




(d-l)/d 


(b) Assume first that LB = TFg. Let J = {ji, J2, ■ • • Let 

d, = max |LB({ji, j2, ■ • ■, Jo ■ • -Jv) “ LB({ji, jz,.. •, Ju ■ • ■, Jt,})!- 

jiji 

The Azuma-Hoeffding inequality implies that 

(27) Pr(LB>ELB + t)<exp|-^^A_|. 


Now fix ji,j 2 ,. •. ■,jrj,ji and suppose that ji G Bk and ji G Bi. Then let 

^ = |LB(ji,j2,.--,jD---,jT,) -LB(ji,j2,--.,jD---,jT,)|- 
Let F = Fi \J ■ ■ ■ \J Fg where F) is the optimal 2-factor for {Xj n Bj) \ X, for 
f = 1, 2,..., s. Suppose that the neighbors of x = Xj. on its cycle C in F are y, z. 
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If \C\ = g then we cannot simply delete x and replace the path {y, x, z) by {y, z) as 
this will produce a 2-factor of girth g — 1. So, let a be the closest point in to x 
that is not on C and let 6 be a neighbor of a on the cycle C of F that contains a. 
The first thing we do now is to delete x and merge the points in C U C" \ {a;} into 
one cycle. We delete the edges {x, y} , {x, z} , {a, b} and add the edges {y, a} , {z, b}. 

The change in cost is at most ^ at log « ^ ^ cycle has length at least 

2g — 1 > g. After this we can insert x = Xi into the cycle D, say, of Fi that contains 
the point c of xi,...,Xi_i,Xi+i, ... ,x„ closest to x. Thus, 

(28) cl.<4d'/A^T 

This proves the lemma for LB = TFg. 

Assume now that LB = HK. Here we can use the results of Goemans and Bertsimas 
[13]. Proposition 3 and Lemma 11 of the same paper shows that di <2 min^^i \xi — 
Xj I and the proof goes through as before. □ 


We are ready to proceed with the proof of Theorem 1.9. We consider the sizes of 
A and A„ = A n A„. We let j3j = \Xn^Bj\. Then we have that 


A = (/3i 


2)! I n(/5i-3)! a 


U=2 


2 )!. 


Now, given we let C denote those edges in whose endpoints both lie 
in Bj, and /' C denote those edges in ly of the form {x^,x^_^_l} or {xj,Xj^i}. 
Given we let Ol denote the set of those edges in Oy whose endpoints both lie in 
Bj and are not equal to the edges {x^, x'j}, {x^, Xj}, {xj, x'j}, or {x^Xj}. Observe 
now that A = 0 unless /„ = /' U Uj=i H = Uj=i We now observe that 


(29) A„ <(/?!-2-|/i|)!2l^^l I (/3, - 2 - |/«|)2l^^l! 

Vi=2 

and, letting (5^ = 1 when |A| > 1, and 0 otherwise: 

(30) 

A„ < (/3i - 2 - Soim - 3)! [lliPj-i- SoOiPj - 4)! ]iPs-2- 5os)(A - 3)! 

\i=2 


since, e.g., the number of ways of covering with paths from 1 to 3 and 2 to 4, 
respectively, while avoiding an edge e which is not {1, 2}, {3,4}, {1,4}, or {2,3} is 
exactly either {(3 — 3)! — {fi — 4)! or {fi — 3)! — 2(6 — 4)!, depending, respectively on 
whether or not e is incident with a vertex in {1, 2, 3,4}. 


Observe now that the Ghernoff bounds give that w.h.p all /3j’s satisfy fij < 2 AT log n. 
In particular, there must be at least \Oy\ (2KToFn) (30) 
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gives 

(31) 


A„ < A • 1 - 


1 


10 . 1(3 




< g-|0„|/(2Klogra) 


2K\ogn^ 

where = U^=i Oi- Also, for L = ljj=i (29) gives (very crudely) that, say, 
(32) A„<A-(|)-I^''I <A-e-l^”l 


Now (31) and (32) establish that large /„ or forces A„ to be small. (Note that 
this part of the argument would have failed if we were working with A„’s in place 
of A„’s.) Thus, defining L = {v G L \ ^ 0}, we have that A = 1J«6L V- In 

particular, since A is large, we can show that the set of leaves L of the branch and 
bound tree must be large (in fact, that L C L is large) by showing that v G L 
implies that either or Oy is large. This is where we use separation of constants. 
Indeed, we will prove: 

Lemma 5.4. Let LB be TF^ or HK. We have w.h.p. that for all v G L, either 
\iv\ + \Oy\ > or else that 

1 

LBiXy\Iy,Oy) < +C(|L| T lai) +o(n^) 

for some constant C. 


To use the lemma to complete the proof of Theorem 1.9, we observe that v G L 
implies that LB(A’„) > TSP)^^!) and thus, from Lemma 5.4, that either |L| + |Ot,| > 
I " , or else that 

log^ n ’ 

|4| + |0„| > (/3tsp - - o {^) 

In particular, in the latter case, the separation /3tsp > /3lb gives that 

\U + \Ov\ = n{^) 

In either case, (31) and (32) will then give that 

Ay < 


and thus that 




completing the proof of Theorem 1.9. 


□ 


Proof of Lemma 5.4- We have that v G L implies that ly = LyU ly and Oy = Oy. 
Now let 

y:=Xn\ {V{Iy) \ V{Oy)). 

Letting J{X) for AT C denote the set such that X = {xj \ j G J{X)}, Lemma 
5.3 gives that 


(33) Pr C A’„, 1^1 < : LB( J(A’„ \ 5)) > (/3lb + 

V log n J 

n \ f c£‘^n‘^ 0 -'^)/d 

Gn/ log^ n) \ 4„(d-2)/<i \og^/d 


< 


< exp 


log^ n 4 log^/'^ n 


= 0 ( 1 ). 
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Thus, taking, e.g., e = we have that w.h.p all leaves v £ L satisfy 

LB(J(J^:)) < 

Recall that an instance of LB consists of independent instances Hj of LB in each 
BjCiynX^- Now, for each j, we patch into Hj the edges in lyCiEj (using Operation 1 
of Lemma 5.1, if LB = HK), the endpoints of the edges in (5„ n Bj (using Operation 
2 of Lemma 5.1, if LB = HK), and the (one or two, depending in j) edges 

{xj,x^} for 2 < j < s 
{x^, a;|} for 1 < j < s — 1 


(again using Operation 1, if LB = HK). In the case where LB = TF^ the patching 
is simply accomplished by rerouting cycles through the edges and points. Note 

l/d 

, the total increase in cost 


K log n 


that, as the squares Bj have diameter Vd ^ 
due to this patching is < C^dLl + |0„|) ^ '°s » ^ Jq,- gome constant Cd- 


Next, we patch the resulting solutions together at the points xl to a global instance 
of the LB (using Lemma 5.2, if LB = HK). It is important to note that by beginning 
with an instance of LB and then patching to this particular instance of LB, we are 
guaranteed this instance includes any edges in . Since there are O(p^) squares of 

diameter the total increased cost from this patching is 

and so we have shown that 

(34) LB(A’„|L U I'y,Oy) < HQ{yi) + Cd{\h\ + |0„|) 

< + Cd{\iy\ + lai) + o(n(''-i)/‘'), 

as desired. □ 


6. Final Remarks 

Our results lead to many natural directions of inquiry, and here we mention just 
a few. Apart from simply increasing the list of separated pairs of constants, the 
following seems like a very good challenge: 

1. What is the relationship between /Smst; /^tF) 

In connection with Theorem 1.4: 

2 . The minimum length of covering of Xn by paths of lengths at least A: is a 
Euclidean functional; let /3p ^ denote the constant in its asymptotic formula. Is it 
true that lim = /3 tsp'^ 

k—¥(X) ’ 


Short of a full confirmation of Conjecture 1.6, one could warm up with some special 


cases: 
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3. Pick an integer k, and then prove or disprove that distinct unlabeled trees T on 
k vertices have distinct asymptotic constants /3|.. 


Finally, we note that as our methods for separating constants give only very small 
differences, we have not attempted to calculate lower bounds on, say, /?xsp ~ Phkj 
or optimize our techniques for this purpose, though this project could be pursued. 
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